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ABSTRACT 

One of the most powerful techniques to study the dark sector of the Uni¬ 
verse is weak gravitational lensing. In practice, to infer the reduced shear, 
weak lensing measures galaxy shapes, which are the consequence of both 
the intrinsic ellipticity of the sources and of the integrated gravitational 
lensing effect along the line of sight. Hence, a very large number of galaxies 
is required in order to average over their individual properties and to isolate 
the weak lensing cosmic shear signal. If this ‘shape noise’ can be reduced, 
significant advances in the power of a weak lensing surveys can be expected. 
This paper describes a general method for extracting the probability dis¬ 
tributions of parameters from catalogues of data using Voronoi cells, which 
has several applications, and has synergies with Bayesian Hierarchical mod¬ 
elling approaches. This allows us to construct a probability distribution for 
the variance of the intrinsic ellipticity as a function of galaxy property us¬ 
ing only photometric data, allowing a reduction of shape noise. As a proof 
of concept the method is applied to the CFHTLenS survey data. We use 
this approach to investigate trends of galaxy properties in the data and 
apply this to the case of weak lensing power spectra. 
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1 INTRODUCTION 


Advances in observational cosmology in the last two 
decades have indicated that the expansion of the Uni- 


verse is accelerating (e.g.[Riess et al. 

1998| Perlmutter 

et al. 

1999[ Schrabback et al.[ 2010 

[Komatsu et al. 

20TT[ 

Bennett et al.|2013[ [Planck Collaboration et al. 

2014a|b Planck Collaboration[[2015 ), rather than de- 


celerating, as previously expected. Consequently, new 
models have been proposed as revisions to our current 


cosmological model (for a review see e.g. Amendola 


et al. (2013); Clifton et al. (2012)), ranging from revi¬ 


sions of general relativity to the existence of a non-zero 
vacuum energy, generically termed “dark energy”, to 
a revision in the fundamental large-scale assumptions 
for the Universe (such as isotropy and homogene¬ 
ity, for papers discussing relaxing the assumption of 

([ 2 ^ 


isotropy, see e.g. 


Jaffe et al. 


McEwen et al. 


(2013); Planck Collaboration et al. (2014c 2015)). 
Methodologies such as weak lensing and galaxy clus¬ 
tering are being developed to place observational con¬ 
straints on these different approaches, and are becom¬ 
ing increasingly powerful, as they are explored and 
their characteristics and statistical behaviours under¬ 
stood. 

One of the most powerful techniques currently 
available is weak gravitational lensing (see e.g. iKaiser 


|1992| [Bartelmann Schneider||2001| [Albrecht et al 
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2006|[Peacock et a l.|2006||Hoekstra &: Jain|2 008l Mun- 

shi et ah] ' 2008[ Massey et al.[[2010[ Weinberg et al. 


2013[ and references therein). Through its gravita¬ 


tional effect, matter acts on light rays from a source to 
observer, and hence distorts the observed shape of the 
source; in analogy to an optical lens. Away from large 
concentrations of matter, such as the centre of galaxy 
clusters, this effect is very small: a change in the ob¬ 
served third flattening, or third eccentricity (colloqui¬ 
ally referred to as ‘ellipticity’) of an image of a galaxy 
of only a few percent, an effect known as weak lensing. 
However if large enough samples of galaxies are ob¬ 
served, with sufRcient attention to observational sys- 
tematics, then a statistical analysis of the ensemble 
can be used to infer the matter distribution along the 
line of sight. Sources at higher redshift allow the map¬ 
ping of the matter distribution at earlier times, and 
from the growth of structure and the relation of the 
amount of distortion to the expansion history of the 
Universe, inferences can be made on cosmological pa- 


2007[ [Benjamin et al.|2007 

Eu et al.|2008[ [Schrabback 

et al.|2010|[Kilbinger et al. 

2013|[Kitching et al.|2014[). 


The change in the ellipticity of galaxies result¬ 
ing from the gravitational lensing effect is known 
as ‘shear’. Ellipticity is an unbiased shear estimator, 
however galaxies are also intrinsically (in the absence 
of lensing) elliptical. However the orientation angle 
of the intrinsic ellipticity is expected to be randomly 
distributed on large scales, which allows one to re- 
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cover the shear by averaging over many galaxies. In 
more detail, because galaxies to a hrst approximation 
have a random projected orientation on the sky the 
intrinsic ellipticity acts as an additional noise term 
when attempting to measure the shear of an object: 
the observed ellipticity is a sum of shear and intrinsic 
ellipticity e = + g, where g is known as the re¬ 

duced shear. The variance on the observed ellipticity is 
cr^(e) = + cr^(^). The variance of the intrinsic 

ellipticity is the dominant term, typically by a factor 
exceeding an order of magnitude. Hence, a very large 
number of galaxies are required in order to reduce this 
by averaging over their individual properties and to 
isolate the weak leasing signal. This term is commonly 
referred to as “shape noise”, and it is by far the largest 
source of uncertainty in weak lensing measurements. 
Shape noise is a limiting fundamental noise source in 
weak lensing measurements that determines the size 
of the observational surveys being designed to mea¬ 
sure cosmological parameters using weak lensing: fu¬ 
ture state-of-the-art surveys such as that to be carried 
out by the European Space Agency’s Euclid mission 
(for details, see |Laureijs et al.||2011| |2012| [Amendola 


|et al.|20T3 and http://www.euclid-ec.org/) encom¬ 
pass most of the extragalactic sky. It follows, therefore, 
if this source of noise could be reduced, signihcant ad¬ 
vances in the power of a weak lensing survey would 
follow, or smaller and less ambitious surveys could be 
contemplated while maintaining the previous survey 
sensitivity. 

Recently Huff et al. (2013) have noted that the 
Tully-Fisher relationship could in principle be used as 
a proxy for the intrinsic ellipticity of a galaxy. However 
Tully-Fisher analyses are not practicable in a weak 
lensing context owing to the very large number of 
galaxies required, most of which are faint and hence 
have a low signal-to-noise ratio. Here we propose to 
generalise the concept to allow the incorporation of 
any other supplementary useful information that will 
provide prior knowledge on the intrinsic ellipticities, 
and which would be practicable. This could be any 
measurable property of a galaxy that does not have a 
correlation with sheai[^ In this paper we use as an ex¬ 
ample the optical magnitudes of the galaxies and their 
combination (pseudo-colours), in the expectation that 
certain galaxy types (characterised by certain colours) 
have a range of ellipticity which is different from other 
galaxy types (characterised by different colours) (for 
a related study with COSMOS sample and the Mil¬ 
lennium Simulation, in the context of intrinsic galaxy 
alignments, see Joachimi et al.||2013 ). 

What we propose is a general method to infer the 
probability distribution of the intrinsic galaxy ellip¬ 
ticity p(|e|) from a weak lensing ellipticity catalogue, 
in particular how to characterise the variance of this 
distribution. We also show how the probability of this 
distribution can be accessed, as this is particularly 


useful in the context of Bayesian hierarchical mod¬ 
elling (e.g. Schneider et al.|[2014| . Currently a Gaus¬ 
sian probability distribution for p{ei) is assumed, with 
a mean for the two ellipticity components (i) of zero 
and a standard deviation a[ei). This standard devi¬ 
ation can be derived from an observed p(|e|) from a 
calibration set (for example from higher spatial resolu¬ 
tion subsets of the data), or it can be taken from the 
dispersion in the measurement of the ellipticities of 
the galaxies. If the ellipticity measurement is carried 
out using a model-htting method, for example /ensht 
(Miller et al. 2007), the ellipticity distribution p(|e|) 
also acts as prior, which again may be from a calibra¬ 
tion set of higher resolution; as used in, for example 
the CFHTLenS analysis ( Miller et al.||2013 ). 

The structure of this paper is as follows. In Sec¬ 
tion we describe the data used and our methodology, 
and then in Section we show how the limitation of 
the resulting p(|e|) ellipticity dispersion width cr(|e|) 
can be used to more optimally weight the dataset to 
achieve increased dark energy Figure of Merit (FoM) 
( Albrecht et al.|2Q06 ). Finally, we summarise and con- 
clude in Section (4] 


2 DATA AND METHODOLOGY 

In this Section we present our general methodology. 
The central concept is that using clustering analysis 
(where the proximity of data points as a function of in¬ 
put features is estimated) of catalogue data the func¬ 
tional dependency of the variance of parameters of 
interest (in our case ellipticity of galaxies) on those 
parameters can be inferred. These approaches in fact 
already have a far more general application than that 
which we propose here, we are simply applying exist¬ 
ing machine learning methodologies in a novel cosmo¬ 
logical context. We hrst describe the data that we will 
use to develop the example of this paper, before de¬ 
scribing the general algorithm, and then the specihcs 
of the example implementation. 


2.1 Data 


The data we use are from the CFHTLenS (see e.g. 
Heymans et al. 2012) catalogue (Erben et al. |2013| ), 
which is a measurement of galaxy ellipticities (using 
the /ensht method by Miller et al. (2007)), galaxy 
photometric redshi fts [Hildebrandt et ah (2012), using 
BPZ (e.g. Benitez 2000 Coe et al. 2006)] and also 
includes in the catalogue derived meta-parameters 
such as galaxy stellar mass and bulge-to-disc ratio. 
The total data set corresponds to observations in 
hve optical bands, ugriz^ over 154 square degrees 
of sky. We use the catalogues downloaded from 
http://WWW.cade-ceda.hia-iha.nrc-enre.gc.ca/ 
community/CFHTLens/query. html , and make no cuts 
in the catalogue. 


^ In this paper we focus, as a proof of concept, on the ellip¬ 
ticity change in galaxies induced by weak lensing. The size 
and magnification of galaxies is also affected by gravita¬ 
tional lensing, but for current surveys the impact of this is 
expected to be small (see e.g. [Duncan et al.| ( [^014| )). How¬ 
ever in extensions to this study the magnification effect 
should be taken into account. 


2.2 Algorithm Overview 

The algorithm, expanded upon in detail in the fol¬ 
lowing sections, is in general terms the following. We 
assume that there are some parameters of interest (in 
our case ellipticity), and some meta-parameters that 
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may (or may not) be related to the parameters of in¬ 
terest. The objective is to determine how the prob¬ 
ability distribution of the parameters of interest is a 
function of the meta-parameters. Using a typical cat¬ 
alogue the dependency of the mean of the parameters 
is trivial to determine, but the full probability distri¬ 
bution is not. 

In summary, we adopt the following approach: 


(i) First we perform a dimensionality reduction on 
the data. This can be of any form, but is required 
to remove degeneracies and to enable computation¬ 
ally efficient subsequent steps. In our example we use 
principal component analysis (PCA; |Pearson||1901| , 
but this could be generalised for example to manifold 
learning (e.g. locally-linear embedding, isomap, etc.; 
Bengio et al. (2012)) or auto-node mapping (where 
the input is linked to the output nodes) in neural net 
analysis. At this stage a removal of the covariance of 
the parameters is also beneficial. 

(ii) Next we perform a clustering analysis on the 
resulting parameters. This clustering analysis should 
be designed to find points in the parameters that are 
‘close’ in some pre-defined distance measure (the most 
straightforward being the geometric distance between 
the points in the parameter space, but more general 
measures can be envisioned). 

(iii) Finally the statistical properties in each cluster 
cell, labelled by a combination of meta-parameters, of 
the parameters of interest can now be computed. For 
example, this could be the mean and standard devia¬ 
tion, amongst other statistics. Furthermore, probabil¬ 
ity density functions can also be derived using, for ex¬ 


ample, kernel density estimation (see e.g. Rosenblatt 


1956 ParzenQ962| Rudemo||1982 Cao et al.|1994). 


We now specify the details of the implementation 
of this approach that we take as an example in this 
paper, and apply this methodology to the CFHTLenS 
catalogue. We then apply this methodology to the case 
of reducing the shape noise term in weak lensing power 
spectrum analysis in an effort to increase the accuracy 
on the measurement of cosmological parameters for 
weak lensing analysis. 


2.3 Dimensionality Reduction 


Our chosen technique relies on identifying relations 
between photometric data and ellipticity measure¬ 
ments. The clarity of these relationships will be more 
evident if we project the five dimensional space of 
the CFHTLenS photometry (from the five bands) to 
a lower-dimensional surface within it which captures 
most of the information content. Our motivation for 
dimensionality reduction is to find those directions in 
the data that contain the majority of the informa¬ 
tion on the correlation (of more general relationship) 
between variables. Hence we perform a dimensional¬ 
ity reduction using PCA ( Pearson]|1901 ), which uses 
an orthogonal transformation to convert a set of ob¬ 
servations of possibly correlated variables into a set 
of values of maximally linearly uncorrelated orthogo¬ 
nal variables called principal components. Before per¬ 
forming the dimensionality reduction with PCA we 


whiter0 the data allowing to project the data onto 
the singular space while scaling each component to 
unit variance. This is useful because the clustering al¬ 
gorithm we adopt below makes assumptions on the 
isotropy of the signal. In the analyses we use the Ran- 
domizedPCA (Halko et al. 2009) implementation of 
the Scikit Learn package (Pedregosa et al. 2011). In 
this process, the linear combinations of magnitudes 
effectively produces a colour-colour hyperspace. 

Figure shows the CFHTLenS photometry pro¬ 
jected onto the two dimensions with the highest eigen¬ 
value principal components. Jointly the two first prin¬ 
cipal components explain approximately 96 per cent 
of the variance in the original CFHTLenS photometry. 
The PCA components with the two highest eigenval¬ 
ues are the following: 


/ 0.20 0.22 0.25 0.25 0.25 

0.96 0.49 -0.08 -0.45 -0.67 


u 

9 


\ 

•( 1 ) 

/ 


Thus, PCA^l can be interpreted to represent an av¬ 
eraged broad band (‘white light’) magnitude, while 
PCA#2 amounts to a colour term ‘blue — red’, where 
the blue part is a combination of u and g and the red 
is comprised mostly of i and z. 

In Figure we colour-code each of the 5770490 
galaxies with the modulus of their ellipticity |e|, cal¬ 
culated from the ellipticity components ei as |e| = 
+ e|, measured in the CFHTLenS survey. Re¬ 
gions with lower mean ellipticity, broadly, those with 
negative values in the first two principal components 
are evident. However, to quantify the mean elliptici- 
ties of galaxies in the different regions, we must create 
samples which have similar values in the PCA^l - 
PCA#2 plane. 


2.4 Clustering 


While the simplest grouping of objects can be achieved 
by simply binning in multiple dimensions (by group¬ 
ing objects based on their positions), this is not op¬ 
timal when the object density varies in the PCA#1 
- PCA#2 plane, because some cells are more densely 
populated while others are sparse. It would be better 
to divide the space into regions with similar photo¬ 
metric parameter ranges. This can be done in a num¬ 
ber of different ways, for example by a self-organising 
map ( Kohonen||1982|. Her e we instead adopt a k- 
means (e.g. |Steinhaus|1957| MacQueen|1967 ) cluster¬ 
ing method (abbreviated to ‘k-means’), because self- 
organising maps can be viewed as a constrained ver¬ 
sion of k-means clustering ( [Hastie et al.|2001 ) and we 
aim to propose a general methodology. 


It can be shown (see e.g. Zha et al. 2001 Ding 
& He 2004) that PCA automatically projects to a 


^ A whitening changes an input vector into a white noise 
vector by transforming a set of random variables with a 
known covanriance matrix into a set of new random vari¬ 
ables which are uncorrelated and all have variance equal to 
unity. In practice, the vectors are divided by the singular 
values. 
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Figure 1. CFHTLenS photometry and colours projected 
onto two dimensions using dimensionality reduction by the 
PCA method. Each galaxy has been colour coded by its 
measured ellipticity |e|. A low value of PCA#1 corresponds 
to high apparent brightness. 


Figure 2. Clusters identified by k-means algorithm from 
the PCA projected data. The projected PCA plane has 
been partitioned to 1700 Voronoi cells. A low value of 
PCA#1 corresponds to high apparent brightness. 


subspace where the global solution of k-means clus¬ 
tering lies, and thus facilitates k-means clustering to 
find near-optimal solutions. It therefore is natural to 
adopt k-means, over self-organised maps, to identify 
regions of similar feature qualities in the projected 
space, k-means aims to partition the galaxies into k 
clusters in which each galaxy belongs to the cluster 
with the smallest sum of squares distance in the fea¬ 
ture space. This results in a partitioning of the pro¬ 
jected data space into Voronoi cells. In the analyses 
we use the MiniBatchKMeans implementation of the 
Scikit Learn package ( Pedregosa et al.||2011| ). 

Figure shows the results of k-means clustering 
when the galaxies are grouped into (^^5770490/2) ~ 
1700 clusters. Because there are more galaxies towards 
the centre of the PCA plane, the Voronoi cells there 
are smaller compared to the cells around the edges. 
This achieves the objective of a more even number 
of galaxies in each cell and hence smaller mean-level 
differences between the cells. 


2.5 Statistics of Parameters of Interest: 
Ellipticities 

Now that we have implemented the algorithm de¬ 
scribed above we can begin to investigate the be¬ 
haviour of the parameters of interest, in our case the 
ellipticities of the galaxies used in the weak leasing 
analysis. In the following the mean and the standard 
deviation of the ellipticities refer to the weighted ver¬ 
sions of these quantities: 

^ ( 2 ) 

ZjWi 

. (3) 

Zj'^i 

The weight Wi for a given galaxy i is taken directly 
from the CFHTLenS catalogue and refers to the in¬ 
verse variance as calculated by the /ensfit algorithm. 


{ei) = 

o-(ei) = 


5. i Mean 


The left panel of Figure shows the mean of the mod¬ 
ulus of ellipticity in each of the 1700 Voronoi cells 
in Figure [^identified by the k-means algorithm from 
the PCA projected data. The colour coding shows low 
and high mean ellipticity regions with blue and red, 
respectively. As a sanity check of the data we also 
calculated the mean of each of the ellipticity compo¬ 
nent (ei) and ( 62 ), which should be consistent with 
zero in the absence of systematic effects. These were 
found to be consistent with zero (all Voronoi cells have 
-0.015 ^ (ei) ^ 0.015). 

It is remarkable how strongly the average of the 
modulus of the ellipticity (|e|) is correlated with the 
PCA^l component of the photometry, with a lesser 
dependence on PCA#2. Given that PCA#1 relates 
to the visual brightness of a galaxy, this can be ex¬ 
plained by three observations. Firstly, at lower red- 
shifts (z < 1) intrinsically brighter objects are likelier 
to be more circular from galaxy evolution arguments 


(e.g. van den Bergh et al.|1996 Conselice et al.| 2000 ) 
such objects are generally elliptical galaxies instead of 
spirals, and ellipticals on average have a lower mea¬ 
sured |e|. Secondly, it is also possible that the galaxies 
in the low mean ellipticity Voronoi cells are mostly 
bright face on spiral galaxies, which tend to be less el¬ 
liptical and visually brighter than more elliptical edge 
on spirals, which may suffer from substantial dust at¬ 
tenuation in optical wavelengths. In addition, a third 
effect will arise from what has been termed as ‘noise 
bias’ ( Viola et al.|2014 and references therein) in the 
weak leasing community. That is, the measured ellip¬ 
ticity of any object is more uncertain (and in general 
biased towards zero) as the noise level in an image in¬ 
creases. In practice, it is likely that all three competing 
effects contribute. 


^.5.^ Standard Deviation 

The right panel of Figure shows the standard devi¬ 
ation of the ellipticities of the galaxies in each of the 
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Figure 3. Left: the mean ellipticity {|e|} in each of the 1700 Voronoi cells identified by k-means algorithm from the PCA 
projected data. Right: the standard deviation of modulus of ellipticity cr(|e|) of the galaxies in each of the 1700 cells 
identified by k-means algorithm from the PCA projected data. Three regions of interest, named “A”, “B”, and “C”, have 
been highlighted. A low value of PCA^l corresponds to high apparent brightness. 


1700 Voronoi cells. The figure shows that the region 
with the highest cr(|e|) is at the centre of the projected 
parameter space and also at the top left corner. Inter¬ 
estingly, the highest mean ellipticity does not corre¬ 
spond to the highest standard deviation regions in the 
colour-colour principal components plane. The highest 
ellipticity variance corresponding to PCA^l values of 
~ 0 is a significant feature, rather than simply a noise 
artefact, as we explain below. 

In some cases the high standard deviations could 
result from larger measurement uncertainties, rather 
than the intrinsic dispersion. However, in the follow¬ 
ing we show that the standard deviations over a large 
part of the PCA plane, especially at high PCA^l 
values with the faintest galaxies, are not dominated 
by measurement errors, but rather the variations in 
the measured galaxy ellipticities. While the elliptic¬ 
ity measurement for an individual galaxy becomes 
more uncertain as we move along the PCA^l, sim¬ 
ply because the signal-to-noise ratio decreases as the 
PCA #1 increases, we do not observe the highest dis¬ 
persions with the highest PCA #1 values. While the 
noise contribution is a monotonic function of PCA^l, 
its contribution should not exceed the variance of the 
cells at a given PCA^l. For all Voronoi cells with 
the centre of their PCA#1 greater than 1.3, we find 
the standard deviation to be 0.004 in the ensemble 
of cells for cr(|e|) compared to the typical values of 
cr(|e|) of ~ 0.14. Hence, even at the faintest PCA#1 
the broadening by measurement noise is not domi¬ 
nant, and it is even less so in all other cells, by thus 
setting an upper limit for the broadening by measure¬ 
ment noise. To demonstrate this further we have also 
repeated the analysis at different signal-to-noise ratio 
slices. The main findings of the paper are largely in¬ 
dependent from the selected SNR. Most importantly, 
the cell statistics remain unchanged when adopting a 
different SNR cut, only the number of cells populated 
by galaxies change, as is evident from Figure In 
the three panels, the SNR selections range from > 10, 
> 20, and > 100. For the cells that are well popu¬ 
lated, the mean ellipticity and the standard deviation 
remain unchanged. 


While the right panel of Figure shows the stan¬ 
dard deviation of the modulus of the ellipticity, how¬ 
ever, for weak lensing analysis the standard devia¬ 
tions of each of the ellipticity components is also of 
interest. We therefore show the standard deviations 
for both components separately in Figure It is evi¬ 
dent from the Figure that these agree well with each 
other and also, to some extent, with the mean of 
the modulus of the ellipticity (the right panel of Fig¬ 
ure . At high signal to noise we find a similar be¬ 
haviour, whereas at low signal to noise we find some 
deviation, where the individual-component variance 
rises, where the modulus’ variance decreases. This re¬ 
sults from two effects that are more pronounced at 
low signal to noise. Firstly, the standard deviation of 
the modulus (which is always positive semi-definite), 
which is approximately a Rayleigh distribution over 
0 < I e I < 1 , is smaller than the standard deviation 
that one would compute from a Gaussian over the in¬ 
terval — 1 < < 1 , the relation being a factor of 

(4 — 7 r)/ 2 . Secondly ei and 62 are correlated, thus the 
standard deviation of the individual components is not 
expected to fully trace the variance of the modulus. 
In the remainder of this paper we will investigate the 
modulus although these considerations should be kept 
in mind in the interpretation of the results. 

We can also identify broad regions of interest in 
the PCA plane, these are named “A”, “B”, and “C”, 
in Figure Is] The “A” region, located at high (> 3.5) 
PCA #2 and intermediate (<0.5) PCA #1 values, con¬ 
tains Voronoi cells with low or average mean elliptic¬ 
ity, but medium to high standard deviation. Instead, 
the region “B”, found with the smallest PCA val¬ 
ues (PCA#1 < —3.5 and PCA#2 < 0.2), contains 
Voronoi cells with the lowest mean ellipticity and stan¬ 
dard deviation. This is especially interesting for weak 
lensing. The third region, named “C”, can be found 
with the smallest (< —1.5) PCA^2 and intermedi¬ 
ate (—2 < PCA #1 < 1 ) PCA #1 values, and contains 
Voronoi cells with the largest mean ellipticity and in¬ 
termediate standard deviation. It will be interesting 
to see what types of galaxies reside in these cells. Fi¬ 
nally, the rest of the area is covered mostly by cells 
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Figure 4. The standard deviation of modulus of ellipticity (T(|e|) of the galaxies in each of the 1700 cells identified by 
k-means algorithm from the PCA projected data with different signal-to-noise ratio cuts Left: All galaxies with SNR > 10. 
Middle: All galaxies with SNR > 20. Right: All galaxies with SNR > 100. A low value of PCA^l corresponds to high 
apparent brightness and a low value of PCA#2 to a bluer galaxy colour. 
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Figure 5. Left: the standard deviation of the first ellipticity component cr(ei) of the galaxies in each of the 1700 cells 
identified by k-means algorithm from the PCA projected data. Right: the standard deviation of the second ellipticity 
component a{e 2 ) of the galaxies in each of the 1700 cells identified by k-means algorithm from the PCA projected data. A 
low value of PCA#1 corresponds to high apparent brightness and a low value of PCA#2 to a bluer galaxy colour. 


with either large mean or large standard deviation. 
We will return to these regions in Section [2^ 


2.5.3 Prohahility Distributions of the Cells 

Figure shows the probability density functions 
(PDFs) for ellipticity derived from all galaxies in a 
given cell for three different Voronoi cells, correspond¬ 
ing to the three regions described above. These are 
representative examples. The left panel of Fig.j^shows 
the ellipticity PDF of cell number 38 in Region A, 
which contains mostly very massive elliptical galaxies 
at intermediate redshifts. It is clear that the Voronoi 
cell must also contain some disk or iSO-galaxies that 
are inclined because the tail of the PDF extends to 
higher ellipticities. The middle panel shows the el¬ 
lipticity PDF of cell number 206 in region B, which 
contains mostly face-on disk or A^O-galaxies at rela¬ 
tively low redshift, peaks at low ellipticity values with 
a skewness to higher ellipticity values. This will lead 
to a low mean cluster ellipticity and a relatively small 
variance. The right panel of Fig. shows the ellip¬ 
ticity PDF of cell number 178 in Region C, contain¬ 
ing mostly inclined disk galaxies at high redshift. The 
peak of the PDF has now shifted to intermediate ellip¬ 
ticities and the distribution is rather broad leading to 


a relatively high variance. These panels can be com¬ 
pared to Figure 7 of Joachimi et al. (2013). In general, 
we find a good agreement. For example, the top panel 
of Figure 7 of Joachimi et al.| ( |2013 ) showing the PDF 
for late-type galaxies is in good qualitative agreement 
with the right panel of Fig. 


2 . 5.4 Prohahility Distributions 

The left panel of Figure shows the ellipticity prob¬ 
ability density function derived from all 1700 Voronoi 
cells in Figure while the right panel of Figure 
shows the probability density function of the stan¬ 
dard deviation of the ellipticities p(cr(|e|)). Note that 
these PDFs (from all cells) are different from those in 
Fig. [^(constructed from galaxies within a cell). By se¬ 
lecting galaxies with particular projected PCA values, 
we can select the appropriate p(cr(|e|)). In the follow¬ 
ing Sections we apply this to the case of a weak leas¬ 
ing power spectrum analysis. For example, the <7a{\e\) 
can be tailored to select those particular galaxies (de¬ 
noted here with a) that may be narrower than the 
all-encompassing cr(|e|) for all galaxies. Therefore the 
shape noise in the power spectrum calculations can be 
reduced. This will be pursued in Section 
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Figure 6. Left: Probability density function for the weighted ellipticity from the galaxies of cell 38 containing mostly very 
massive elliptical galaxies at intermediate redshifts (region “A”). Middle: Probability density function for the weighted 
ellipticity from the galaxies of cell 206 containing mostly face-on disk or ^SO-galaxies at relatively low redshift (region “B”). 
Right: Probability density function for the weighted ellipticity from the galaxies of cell 178 containing mostly inclined disk 
galaxies at high redshift (region “C”). 




Figure 7. Left: probability density function for the weighted mean ellipticity |e| from the clustered data with 1700 individual 
Voronoi cells. Right: probability density function for the weighted standard deviation of the ellipticity cr(|e|) from the 
clustered data with 1700 individual cells. 


2.6 Redshift 

The weak lensing power spectrum must be generated 
at different redshifts to enable weak lensing tomogra¬ 
phy, as this requires several redshifts to measure the 
dark energy equation of state (dark energy being an ef¬ 
fect that changes the expansion history of the Universe 
as a function of redshift). It is therefore interesting to 
look at the redshift distribution of galaxies measured 
in the CFHTLenS survey in our k-means Voronoi cells. 
We use the probability density function estimates (full 
posterior redshift information) from CFHTLenS cat¬ 
alogues, but visualise the point estimate of the PDF. 
The point estimates presented are the peaks of the 
PDFs. 


2.7 Galaxy Properties 

In the previous sections we have looked at the average 
properties of the galaxies in the Voronoi cells, which 
are relevant for weak gravitational lensing. However, 
because of the clear trends in the average ellipticities 
as shown by the left panel of Figure it is interesting 
to look also at the astrophysical properties of these 
galaxies. There are many parameters that describe 
galaxy morphology and astrophysical properties, here 
we chose one that was derived from the CFHTLenS 
ellipticity measurements (using /ensfit; the bulge-to- 
disc ratio) and one that was derived from photomet¬ 
ric measurements (the stellar mass) for comparison. 
However this approach could be generalised to any 
property listed in a catalogue. 


Figure shows the mean and the standard devia¬ 
tion of the redshift of the galaxies in each of the 1700 
cells. The figure shows that the highest and the lowest 
redshift galaxies are separated in different regions in 
the plane, the lowest average redshift cells favouring 
the most negative PCA#1 values, while the highest 
redshift cells can be found with the smallest PCA#2 
values. Over a large number of cells the variance be¬ 
tween the photometric redshifts of the galaxies in each 
cell is fairly low, as shown by the right panel of Figure 

E 


2.7.1 Bulge-to-Disc Ratios 


A fundamental galaxy property is the bulge-to-disc 
ratio, which can be used as a proxy for the morpho¬ 


logical type of a galaxy (e.g. Simien & de Vaucouleurs 
1986| . Figure]^ shows the mean and the standard de¬ 


viation of the bulge fractions of the galaxies measured 
in the CFHTLenS (column BULGE_FRACTION in 
the CFHTLenS catalogue) in each of the 1700 k-means 
clusters. A clear trend can be observed; the smaller the 
PCA^l and higher the PCA#2 value, the higher the 
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PCA #1 PCA #1 


Figure 8. Left: the mean redshift {z) in each of the 1700 Voronoi cells identified by k-means algorithm from the PCA 
projected data. Right: the standard deviation of the redshift (t{z) of the galaxies in each of the 1700 Voronoi cells identified 
by k-means algorithm from the PCA projected data. A low value of PCA#1 corresponds to high apparent brightness. 


average bulge fraction. Thus, galaxies in these Voronoi 
cells are more likely to be bulge dominated elliptical 
galaxies. If this is true, there should also be a similar 
trend in the average stellar masses. 


2.1.2 Stellar Masses 

For the stellar masses we use the LPJoglOSM^MED 
column of the CFHTLenS catalogue, which contains 
the median estimate for the stellar mass. The miss¬ 
ing values, indicated by —99, were omitted from the 
calculations. Figure shows the mean and the stan¬ 
dard deviation of the stellar masses of the galaxies 
in each of the 1700 k-means Voronoi cells. A clear 
trend along the PCA#2 can be observed: higher values 
of the PCA component correspond to average higher 
stellar masses, which is as expected because stellar 
mass is related to galaxy colour. This agrees with the 
interpretation of the bulge-to-disc ratio results for the 
location of elliptical galaxies in the PCA plane. 


2.8 Regions of Interest 


The three regions, “A”, “B”, and “C”, identified ear¬ 
lier can now be linked to different galaxy types. For 
example, the Voronoi cells in region “A”, that is the 
highest (> 3.5) PCA#2 and intermediate (< 0.5) 
PCA#1 values, contain very massive elliptical galax¬ 
ies, which reside on average at an intermediate red- 
shifts {{z) ~ 0.7). Region “B”, with the smallest PCA 
values (PCA#1 < —3.5 and PCA#2 <0.2), contains 


cells with mostly relatively low mass (see Fig. 10) 
face-on disk or 5'0-galaxies (Fig. who reside rel¬ 
ative nearby (low redshift, see Fig. |^. The highest 
redshift {{z) ~ 2) disk galaxies, which are also often 
inclined, can be found in region “C” with the smallest 
(< —1.5) PCA#2 and intermediate (—2 < PCA^l < 
ij PCA#1 values. 


3 APPLICATION: IMPROVED 
PARAMETER INFERENCE 

3.1 Context 

Once the distribution functions of the statistical prop¬ 
erties of parameters of interest are available as a func¬ 
tion of meta-parameters, these can then be used to im¬ 
prove the inferences made on model parameters that 
one may wish to fit to the data. This approach is 
tied very closely to the approach of Bayesian Hier¬ 


archical modelling (e.g. Schneider et al. 2014), and 
indeed the distribution functions uncovered using our 
approach can be used as initial input into a Bayesian 
network where prior probability distributions of meta¬ 
parameters and parameters of interest are required in 
order to efficiently sample the joint likelihood space 
investigated. When including a full model of the data 
in such an approach, the aggregated gains commonly 
lead to improved parameter precision and accuracy 
(as has been seen in a supernovae application in cos¬ 
mology, see e.g. Mandel et al. (2009 2011)). 

In this paper we apply the distribution of the 
standard deviation of ehipticity in a more simple, but 
also pedagogical, manner where we extend the weak 
leasing power spectrum formalism to include such a 
distribution and assess the change in expected cosmo¬ 
logical parameter performance derived from such an 
analysis. 


3.2 Generalised Cosmic Shear Power Spectra 


Previous cosmic shear analyses use a single popula¬ 


tion of galaxies (or split into two samples e.g. Hey- 
mans et al. (2013)), after selections are made at the 
shape measurement stage, and a single ehipticity vari¬ 
ance calculated and applied to calculate a single cos¬ 
mic shear power spectrum. This would correspond to 
a delta-function distribution in the right panel of Fig¬ 
ure [3 In our case the data can be subdivided into 
several sub-populations, each with a different number 
density na{z) and ehipticity variance cr^(|e|) (hence¬ 
forth shortened to cTe^a)? these can be combined in a 
more optimal way. 

The observed tomographic cosmic shear power 
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PCA #1 PCA #1 


Figure 9. Left: the mean bulge fraction in each of the 1700 Voronoi cells identified by k-means algorithm from the PCA 
projected data. Right: the standard deviation of the bulge fraction of the galaxies in each of the 1700 Voronoi cells identified 
by k-means algorithm from the PCA projected data. A low value of PCA#1 corresponds to high apparent brightness. 



PCA #1 PCA #1 


Figure 10. Left: the mean stellar mass {\ogiQ(Ms [AIq])) in each of the 1700 Voronoi cells identified by k-means algorithm 
from the PCA projected data. Right: the standard deviation of the stellar mass a{\ogiQ{Ms [^o])) of the galaxies in each 
of the 1700 Voronoi cells identified by k-means algorithm from the PCA projected data. A low value of PCA^l corresponds 
to high apparent brightness. 


spectrum (see |Hu 1999) can be written as a sum of 
a signal and noise term 

Cij^i = Cij^i + Nij^i , (4) 

where ij refer to redshift bin pairs (such that i = j 
refers to intra-bin power or ‘auto-correlation’, and i 
j refers to inter-bin power or ‘cross-correlation’), and £ 
refers to an angular wavenumber. We refer the reader 
to Kitching et al. (2011) where this is derived from a 


spherical harmonic-spherical Bessel representation of 
the full three-dimensional shear field. 

The signal part of the power spectrum is a 
function of angular diameter distances of the source 
redshift, and a function of the matter power spectrum 
as a function of redshift (or comoving distance): 


= A 




a(r'y 'v-z-’-i-J/ ■ (5) 

The constant A = (3QMi^o/2c^)^, where Qm is the 
dimensionless matter density, iLo is the current value 
of the Hubble parameter, zh is the redshift of the 
cosmic horizon, c is the speed of light in a vacuum, 
a(r) is the dimensionless scale factor, P(£/r;r(z)) is 


the power spectrum of matter perturbations, and r(z) 
is a comoving distance. The weight function is 

rr(zH) 


w(r) = 


dr' p(r'\r[zi\) 


fK(r' - r) 
fKir') 


(6) 


where /ic(r) = sinh(r), r, sin(r) for curvatures of 
K = —1, 0, 1, and p(r' | r) is the probability that a 
galaxy with comoving distance r is observed at dis¬ 
tance This representation of the shear power spec¬ 
trum assumes the Limber approximation, a spherical 
Bessel transform to comoving distance, and a binning 
in redshift; for a derivation of this from the full 3D cos¬ 
mic shear power spectrum see Kitching et al. (2011). 

The noise part of the power spectrum is 






(7) 


where is the ellipticity variance, 6 ^ is a Kroneger 
delta-function, and Nu is the mean number of galax¬ 
ies in redshift bin i. To minimise the noise contribu¬ 
tion, one should aim to maximise the mean number of 
galaxies, while minimising the ellipticity variance. 

For each Win galaxy populations, each with a dif- 
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ferent number density and ellipticity variance, derived 
using the process described in the previous Section, 
the power spectra can be combined using an inverse- 
variance weighted sum 


Cij^i — 


SXlMa) 


(8) 


where is the power spectrum computed for the 

population a, that has a number density na{z) and 
an ellipticity variance (Jg q,. We assume here that each 
population is subdivided into the same redshift bin 
set, although this can in principle also change between 
the populations. 


3.3 Applying the Inverse Weighting 


Here we present a prediction for how such weighting 
will impact cosmological parameter estimation using 
tomographic cosmic shear. To compute expected cos¬ 
mological parameter errors we use the Fisher matrix 
formalism present in Hu ( 1999| for cosmic shear to¬ 
mography. This results in a matrix Fh/3 (the Greek 
letters denote cosmological parameters) where the 
is a vector of expected, marginalised, 
cosmological parameter uncertainties. We use a Cold 
Dark Matter (CDM) cosmology with a varying dark 
energy equation of state, where the free parameters 
that we use are Dm, Db, erg, ico, rca, h, ris (respec¬ 
tively, the dimensionless matter density; dimensionless 
baryon density; the amplitude of matter fluctuations 
on 8Mpc scales - a normalisation of the power spec¬ 
trum of matter perturbations; the dark energy equa¬ 
tion of state parametrised hy w{z) = WQ-\-WaZ/(I z)] 

the Hubble parameter h = i/o/100kms“^Mpc“^; and 
the scalar spectral index of initial matter perturba¬ 
tions). For each parameter we use the Planck max¬ 
imum likelihood values (Planck Collaboration et al. 
[2014 a|b| about which to take derivative of the power 
spectra for the Fisher matrix. All parameter errors 
and biases we quote are marginalised over all other 
parameters in this set. We use the camb sources code 
(see http://camb. info/sources/ and [Seljak Zah 


darriaga|1996| [Lewis fc Bridle|200^|Challinor fc Lewis 


201 1| to compute the cosmic shear tomographic power 
spectrej^ We use a maximum radial wavenumber of 
^max = 10/iMpc“^ and a corresponding redshift- 
dependent maximum Cmode of ^ = kma.^r[z]. 

The cosmic shear survey we assume is a Euclid- 
like experiment that has an area of 15000 square de¬ 
grees, a median redshift of Zm = 0.9, a number den¬ 
sity of 30 galaxies per square areminute with a num¬ 


ber density distribution n{z) given in Taylor et al. 
(2007), and a photometric redshift distribution that is 


assumed to be Gaussian with a standard deviation of 
a{z) = 0.05(1 + z). These characteristic are described 


Laureijs et al. (2011) 


To investigate the expected effect we first look at 
some simple model examples to illustrate the essential 
aspects of the information change that occurs when 
including a more general distribution of the p{cr). We 
use the probability distribution of the variance shown 


^ Code to reproduce the results of this paper is available 
on request. 



Figure 11. The predicted marginalised 2-parameter Icr 
uncertainty ellipsoids in the {wq, Wa) plane with and with¬ 
out the inverse-variance weighting, for a Euclid-\\ke exper¬ 
iment in case half of the galaxies have half the ellipticity 
standard deviation. 


in the right panel of Figure and assume that the 
number density distribution as a function of redshift 
is the same for each population. We compare with the 
standard case, where the mean of the variance is taken 
from the distribution in the right panel of Figurej^and 
a single population is assumed. We find that the in¬ 
clusion of the inverse-variance weighting, in this case, 
improves the predicted dark en ergy Figure-of-Merit 
(l/(cr(ieo)cr(iea) — [cr^(ieo, rca)])) (Albrecht et al. 2006) 
by a modest two per cent. However, this improvement 
demonstrates the principle of the improvement that 
may be gained, and this will be more significant when 
several such meta-parameters of the data are com¬ 
bined. 

To demonstrate the principle further, we show in 
Figure 11 the {wq^Wo) predicted projected la uncer¬ 
tainty contours for the case in which no Ue weight¬ 
ing is used and the case in which half of the galax¬ 
ies have a <7e which is half the nominal case i.e. the 
distribution is a top hat function from 0.15 to 0.30. 
It is clear from this Figure that if additional meta¬ 
parameters can be found, such that Voronoi cells with 
small ellipticity variance can be identified, significant 
gains can be made, and this would in fact become the 
dominant consideration in supplementing weak leas¬ 
ing surveys for dark energy science. Using supplemen¬ 
tary data such as that discussed in Huff et al. (2013) 
there remains the possibility of finding such a low-ae 
measure or population of galaxies. 


4 CONCLUSION 

In this paper we present a method that can be used 
to find how the probability distributions of parame¬ 
ters of interest depend on other, meta-parameters in 
the data. This uses a dimensionality reduction and 
noise whitening, followed by a clustering analysis. We 
use this to investigate how the probability distribu¬ 
tions of galaxy ellipticity depend on photometry of 
galaxies. In particular we construct a probability dis¬ 
tribution for the intrinsic standard deviation of the 
galaxy ellipticities p(a(|e|)). 

We apply this to the investigation of galaxy prop- 
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erties and find that such an approach is particularly 
good at isolating galaxy type: where regions in the 
clustering analysis correspond to early and late type 
galaxies. Finally, we apply this to the case of improv¬ 
ing cosmological parameter inference using weak leas¬ 
ing and find a few percent improvement in dark energy 
measurements even in this simple application. 

The method we use here has several applications 
as a tool for cosmology, and has synergies with a full 
Bayesian hierarchical modelling approach. 
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